SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: 



Yanagisawa, Masashi 
Bergsma, Derk 
Wilson, Shelagh 
Brooks, David 
Gellai, Miklos 



(ii) TITLE OF THE INVENTION: NOVEL LIGANDS OF THE NEUROPEPTIDE 

RECEPTOR HFGAN72 

(iii) NUMBER OF SEQUENCES: 21 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham C^o'rporation 

(B) STREET: 7 09 Swedeland RcJ^d 

(C) CITY: King of Prussia 

(D) STATE: PA 



(E) COUNTRY: United Sta 

(F) ZIP: 19406 




f America 



(v) COMPUTER READA^LE'^ FORM : 

(A) MEDIUM TYPE:/i:!U.sk^tte 

(B) COMPU'pfil^: \ IbK q.orfpatible 

(C) OPERATING S^iJ^TEM: DOS 

(D) SOFTteCRE: ^astSEQ for Windows Version 2.0 



(vi) CURRENT APPLICATION DATA: 



/ 



(A) /APPLICATION NUMBER: 08/938,548 
(sf FILING DATE: 26-SEPT-1997 
(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 




(A) APPLICATION NUMBER: 08/887,382 

(B) FILING DATE: 2 -JUL- 199 7 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/820,519 

(B) FILING DATE: 19 -MAR- 19 9 7 

(A) APPLICATION NUMBER: 60/033,604 

(B) FILING DATE: 17-DEC-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: King, William T. 

(B) REGISTRATION NUMBER: 30,954 

(C) REFERENCE /DOCKET NUMBER: ATG5 0 03 7-2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-5219 

(B) TELEFAX: 610-270-4026 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AAAACATAAT GTGGGTCTCG CGTCTGCCTC TCTCCCGCCC CTAATTAGCA GCTGCCTCCC 
TCCATATTGT CCCAGGCCAG CGCTTCTTTT GTGCTCCCAG ATTCCTGGGT GCAAGGTGGC 
CTCATTAGTG CCCGGAGACC GCCCCATCTC CAGGGAGCAG ATAGACAGAC AAGGGGGTGA 



60 
120 
180 




TCAGGGGCAC 


AGTGATCCAA 


CCCTGGCCTC 


TGAACGCCGC 


AGCGGCCATT 


CCTTGGGCCC 


240 


AGCCTGGAGA 


CGGCCCCCCT 


GCAGCAGGCT 


AATCTTAGAC 


TTGCCTTTGT 


CTGGCCTGGG 


300 


TGTGGACGCA 


ATGTGCCTGT 


CAATTCCCCG 


CCACCTCAGA 


GCACTATAAA 


CCCCAGACCC 


360 


CTGGGAGTGG 


GTCACAATTG 


ACAGCCTCAA 


GGTTCCTGGC 


TTTTTGAACC 


ACCACAGACA 


420 


TCTCCTTTCC 


CGGCTACCCC 


ACCCTGAGCG 


CCAGACACCA 


TGAACCTTCC 


TTCCACAAAG 


480 


GTAAAGATCC 


AGGGATGGAG 


GGGTGACTCA 


GCCATCCCAG 


AGGAAGCAAA 


AAGAGTGCTT 


540 


GCTCAGAGGG 


CTGGAAGAAA 


GGCCAAAGGT 


GTCTCCACTC 


TTGGTCTTTT 


CCTGGGTGTG 


600 


CTCTGAGGCA 


GGAGCACCTG 


CCTTGGCTCA 


CATTGGGTTG 


GGTGCTGTTT 


TGCTAAGAGC 


660 


CTGTGTTTGC 


TGAGCTCATA 


TGTGTCAGGT 


GCTCCGTTTG 


CACCTGTCAT 


CTCTTGTCAT 


720 


CCTCCCAACA 


GCCTTGCAGA 


GTAGAAATTA 


TTTCTAGTAT 


ACCCAGTTTA 


CAGGTAAGGG 


780 


AGCTGTGCCC 


TCTGAAAGGG 


CAGGAAACTG 


GTTCAAAGCA 


ACGGAGTTCA 


GTCACTCCTG 


840 


CAAGGGGGCA 


GGCAGATGAG 


AGAGCATTCT 


GGAGTCTTGC 


TAGTTCCTGA 


TTTCCATGTG 


900 


TTTCCCTGCT 


GTGGAGAGGA 


AGTTGGGGGG 


ACTCAGTAGG 


GCCCGGGTTT 


TTCCCAAGTT 


960 


TACAACTTCT 


GCTGCAGACA 


GACACTCCTG 


TTTTCAGGTG 


GAGTGGCAAG 


TGCCCTAGTG 


1020 


GTGGCAACAG 


TGGCCTAAGT 


CTCCAGAGAA 


AAGGGGGATT 


CACTCTGCCC 


AGGGGGTCTC 


1080 


AAAAGGCTTC 


CTGTGGGAGA 


TGCTCTGCTG 


GGTCTTGAAG 


GAGGAGCAGG 


GAAAGTAGGC 


1140 


CGATACCAGC 


AAGGGCGCAA 


AGCAAGGAGA 


ACTAAGTGAC 


AGCCAGAAAG 


GAGTGCAGGC 


1200 


TTGGAGGGGG 


CGCGGAGCCA 


GAGGGGCAGG 


TCCTGTGCGT 


GGGAGCTGGT 


GGCGGGCGCC 


1260 


GTGGGAAGAC 


CCCCCCAGCG 


CCCTGTCTCC 


GTCTCCCTAG 


GTCTCCTGGG 


CCGCCGTGAC 


1320 


GCTACTGCTG 


CTGCTGCTGC 


TGCTGCCGCC 


CGCGCTGTTG 


TCGTCCGGGG 


CGGCTGCACA 


1380 


GCCCCTGCCC 


GACTGCTGTC 


GTCAAAAGAC 


TTGCTCTTGC 


CGCCTCTACG 


AGCTGCTGCA 


1440 


CGGCGCGGGC 


AATCACGCGG 


CCGGCATCCT 


CACGCTGGGC 


AAGCGGAGGT 


CCGGGCCCCC 


1500 


GGGCCTCCAG 


GGTCGGCTGC 


AGCGCCTCCT 


GCAGGCCAGC 


GGCAACCACG 


CCGCGGGCAT 


1560 


CCTGACCATG 


GGCCGCCGCG 


CAGGCGCAGA 


GCCAGCGCCG 


CGCCGCTGCC 


TCGGGCGCCG 


1620 


CTGTTCCGCC 


CCGGCCGCCG 


CCTCCGTCGC 


GCCCGGAGGA 


CAGTCCGGGA 


TCTGAGTCGT 


1680 


TCTTCGGGCC 


CTGTCCTGGC 


CCAGGCCTCT 


GCCCTCTGCC 


CACCCAGCGT 


CAGCCCCCAG 


1740 


AAAAAAGGCA 


ATAAAGACGA 


GTCTCCATTC 


GTGTGACTGG 


TCTCTGTTCC 


TGTGCGGTCG 


1800 


CGTCCTGCCC 


ATCCGGGGTG 


GCAAAGCGTC 


TTGCGGAGGA 


CAGCTGGGCC 


TGGAAGCCCG 


1860 


GCTGTCGGGC 


ACCAGCCTTA 


GCTTTTGCGT 


GGTTGAATCG 


GAAACACTCT 


TGGTTGGGGA 


1920 


GTTCCCAGTG 


CAAGGCCCTG 


GGGCACAGAG 


AGAACTGCAC 


AGGTGCATGC 




1970 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 131 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Val Thr Leu Leu 
15 

Ser Gly Ala Ala 
30 

Cys Ser Cys Arg 
45 

Ala Gly lie Leu 

Gin Gly Arg Leu 
80 

Gly lie Leu Thr 
95 

Pro Cys Leu Gly 
110 

Pro Gly Gly Gin 
125 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Met Asn Leu Pro 
1 

Leu Leu Leu Leu 
20 

Ala Gin Pro Leu 
35 

Leu Tyr Glu Leu 
50 

Thr Leu Gly Lys 
65 

Gin Arg Leu Leu 

Met Gly Arg Arg 
100 

Arg Arg Cys Ser 

115 
Ser Gly lie 
130 



Ser Thr Lys Val 
5 

Leu Leu Pro Pro 

Pro Asp Cys Cys 
40 

Leu His Gly Ala 
55 

Arg Arg Ser Gly 
70 

Gin Ala Ser Gly 
85 

Ala Gly Ala Glu 

Ala Pro Ala Ala 
120 



Ser Trp Ala Ala 
10 

Ala Leu Leu Ser 
25 

Arg Gin Lys Thr 

Gly Asn His Ala 
60 

Pro Pro Gly Leu 
75 

Asn His Ala Ala 
90 

Pro Ala Pro Arg 
105 

Ala Ser Val Ala 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 

1 5 10 15 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
20 25 30 

Leu 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Arg Ser Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin 

15 10 15 

Ala Ser Gly Asn His Ala Ala Gly lie Leu Thr Met 
20 25 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 





(ii) 



MOLECULE TYPE: cDNA 



(xi) 



SEQUENCE DESCRIPTION: 



SEQ ID NO: 5: 



GGCTCGGCGG 


CCTCAGACTC 


CTTGGGTATT 


TGGACCACTG 


CACCGAAGAT 


ACCATCTCTC 


CGGATTGCCT 


CTCCCTGAGC 


TCCAGACACC 


ATGAACCTTC 


CTTCTACAAA 


GGTTCCCTGG 


GCCGCCGTGA 


CGCTGCTGCT 


GCTGCTACTG 


CTGCCGCCGG 


CGCTGCTGTC 


GCTTGGGGTG 


GACGCGCAGC 


CTCTGCCCGA 


CTGCTGTCGC 


CAGAAGACGT 


GTTCCTGCCG 


TCTCTACGAA 


CTGTTGCACG 


GAGCTGGCAA 


CCACGCCGCG 


GGCATCCTCA 


CTCTGGGAAA 


GCGGCGACCT 


GGACCCCCAG 


GCCTCCAAGG 


ACGGCTGCAG 


CGCCTCCTTC 


AGGCCAACGG 


TAACCACGCA 


GCTGGCATCC 


TGACCATGGG 


CCGCCGCGCA 


GGCGCAGAGC 


TAGAGCCATA 


TCCCTGCCCT 


GGTCGCCGCT 


GTCCGACTGC 


AACCGCCACC 


GCTTTAGCGC 


CCCGGGGCGG 


ATCCAGAGTC 


TGAACCCGTC 


TTCTATCCCT 


GTCCTAGTCC 


TAACTTTCCC 


CTCTCCTCGC 


CGGTCCCTAG 


GCAATAAAGA 


CGTTTCTCTG 


CTAAAAAAAA 


AAAAAAAAAA 


AAAAA 





(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asn Leu Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 

15 10 15 

Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 



Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 



Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 



60 
120 
180 
240 
300 
360 
420 
480 
540 
585 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Leu Gly Lys Arg Arg 
65 

Arg Leu Leu Gin Ala 
85 

Gly Arg Arg Ala Gly 
100 

Arg Cys Pro Thr Ala 

115 120 

Arg Val 
130 



Leu Gin Gly Arg Leu Gin 
75 80 
Ala Gly lie Leu Thr Met 
95 

Tyr Pro Cys Pro Gly Arg 
110 

Ala Pro Arg Gly Gly Ser 
125 



Pro Gly Pro Pro Gly 
70 

Asn Gly Asn His Ala 
90 

Ala Glu Leu Glu Pro 
105 

Thr Ala Thr Ala Leu 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Asn Leu Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 

15 10 15 

Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 

20 25 30 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 

15 10 15 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
20 25 30 

Leu 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin Ala 

15 10 15 

Asn Gly Asn His Ala Ala Gly lie Leu Thr Met 
20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Pro Trp Ala Ala Val Thr Leu Leu Leu Leu Leu Leu Leu Pro Pro 

15 10 15 

Ala Leu Leu Ser Leu Gly Val Asp Ala Gin Pro Leu Pro Asp Cys Cys 

20 25 30 

Arg Gin Lys Thr Cys Ser Cys Arg Leu Tyr Glu Leu Leu His Gly Ala 

35 40 45 

Gly Asn His Ala Ala Gly lie Leu Thr Leu Gly Lys Arg Arg Pro Gly 

50 55 60 

Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin Ala Asn Gly 
65 70 75 80 

Asn His Ala Ala Gly lie Leu Thr Met Gly Arg Arg Ala Gly Ala Glu 

85 90 95 

Leu Glu Pro His Pro Cys Ser Gly Arg Gly Cys Pro Thr Val Thr Thr 

100 105 110 

Thr Ala Leu Ala Pro Arg Gly Gly Ser Gly Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 

15 10 15 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin 

1 5 10 15 

Ala Asn Gly Asn His Ala Ala Gly lie Leu Thr Met 
20 25 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CAACCNCTNC CNGACTGCTG 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ATNCCNGCNG CATGATT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GTTGCCAGCT CCGTGCAACA GTTCGTAGAG ACGG 

(2) INFORMATION FOR SEQ ID NO: 16: 

' (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 





(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18,: 



CGGCAGGAAC ACGTCTTCTG GCG 



23 



TCCTTGGGTA TTTGGACCAC TGCACCGAAG 



30 




ATACCATCTC TCCGGATTGC CTCTCCCTGA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
CCTCTGAAGG TTCCAGAATC GATAGTAN 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

CCTCTGAAGG TTCCAGAATC GATAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 577 base pairs 



# 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



CACAATTGAC 


AGCCTCAAGG 


TTCCTGGCTT 


TTTGAACCAC 


CACAGACATC 


TCCTTTCCCG 


60 


GCTACCCCAC 


CCTGAGCGCC 


AGACACCATG 


AACCTTCCTT 


CCACAAAGGT 


CTCCTGGGCC 


120 


GCCGTGACGC 


TACTGCTGCT 


GCTGCTGCTG 


CTGCCGCCCG 


CGCTGTTGTC 


GTCCGGGGCG 


180 


GCTGCACAGC 


CCCTGCCCGA 


CTGCTGTCGT 


CAAAAGACTT 


GCTCTTGCCG 


CCTCTACGAG 


240 


CTGCTGCACG 


GCGCGGGCAA 


TCACGCGGCC 


GGCATCCTCA 


CGCTGGGCAA 


GCGGAGGTCC 


300 


GGGCCCCCGG 


GCCTCCAGGG 


TCGGCTGCAG 


CGCCTCCTGC 


AGGCCAGCGG 


CAACCACGCC 


360 


GCGGGCATCC 


TGACCATGGG 


CCGCCGCGCA 


GGCGCAGAGC 


CAGCGCCGCG 


CCCCTGCCTC 


420 


GGGCGCCGCT 


GTTCCGCCCC 


GGCCGCCGCC 


TCCGTCGCGC 


CCGGAGGACA 


GTCCGGGATC 


480 


TGAGTCGTTC 


TTCGGGCCCT 


GTCCTGGCCC 


AGGCCTCTGC 


CCTCTGCCCA 


CCCAGCGTCA 


540 


GCCCCCAGAA 


AAAAGGCAAT 


AAAGACGAGT 


CTCCATT 






577 



